智能论文笔记

Scalable Bayesian Optimization Using Deep Neural Networks

Jasper Snoek , Oren Rippel , Kevin Swersky , Ryan Kiros , Nadathur Satish , Narayanan Sundaram , Md. Mostofa Ali Patwary , Prabhat , Ryan P. Adams

分类：

2015-02-19

Bayesian optimization is an effective methodology for the global optimization of functions with expensive evaluations. It relies on querying a distribution over functions defined by a relatively cheap surrogate model. An accurate model for this distribution over functions is critical to the effectiveness of the approach, and is typically fit using Gaussian processes (GPs). However, since GPs scale cubically with the number of observations, it has been challenging to handle objectives whose optimization requires many evaluations, and as such, massively parallelizing the optimization.In this work, we explore the use of neural networks as an alternative to GPs to model distributions over functions. We show that performing adaptive basis function regression with a neural network as the parametric form performs competitively with state-of-the-art GP-based approaches, but scales linearly with the number of data rather than cubically. This allows us to achieve a previously intractable degree of parallelism, which we apply to large scale hyperparameter optimization, rapidly finding competitive models on benchmark object recognition tasks using convolutional networks, and image caption generation using neural language models.

translated by 谷歌翻译

Factuality Enhanced Language Models for Open-Ended Text Generation

Nayeon Lee , Wei Ping , Peng Xu , Mostofa Patwary , Mohammad Shoeybi , Bryan Catanzaro

分类：自然语言处理 | 人工智能 | 机器学习

2022-06-09

预审前的语言模型（LMS）易于生成具有非事实信息的文本。在这项工作中，我们测量并提高了开放式文本生成的大规模LMS的事实准确性。我们设计了FactualityPrompts测试集和指标，以衡量LM世代的事实。基于此，我们研究了参数尺寸范围从126m到530b不等的LMS的事实准确性。有趣的是，我们发现较大的LM比较小的LM更为事实，尽管先前的研究表明，在误解方面较大的LMS可能不太真实。此外，开放式文本生成中流行的采样算法（例如，顶级P）可能会损害由于每个采样步骤中引入的“均匀随机性”，因此损害了事实。我们提出的事实核采样算法会动态适应随机性，以改善发电的事实，同时保持质量。此外，我们分析了从事实文本语料库（例如Wikipedia）学习实体之间正确关联的标准培训方法的效率低下。我们提出了一种事实增强的培训方法，该方法使用topicprefix更好地意识到事实和句子完成作为培训目标，这可以大大减少事实错误。

translated by 谷歌翻译

Performance Analysis of YOLO-based Architectures for Vehicle Detection from Traffic Images in Bangladesh

Refaat Mohammad Alamgir , Ali Abir Shuvro , Mueeze Al Mushabbir , Mohammed Ashfaq Raiyan , Nusrat Jahan Rani , Md. Mushfiqur Rahman , Md. Hasanul Kabir , Sabbir Ahmed

分类：计算机视觉

2022-12-18

The task of locating and classifying different types of vehicles has become a vital element in numerous applications of automation and intelligent systems ranging from traffic surveillance to vehicle identification and many more. In recent times, Deep Learning models have been dominating the field of vehicle detection. Yet, Bangladeshi vehicle detection has remained a relatively unexplored area. One of the main goals of vehicle detection is its real-time application, where `You Only Look Once' (YOLO) models have proven to be the most effective architecture. In this work, intending to find the best-suited YOLO architecture for fast and accurate vehicle detection from traffic images in Bangladesh, we have conducted a performance analysis of different variants of the YOLO-based architectures such as YOLOV3, YOLOV5s, and YOLOV5x. The models were trained on a dataset containing 7390 images belonging to 21 types of vehicles comprising samples from the DhakaAI dataset, the Poribohon-BD dataset, and our self-collected images. After thorough quantitative and qualitative analysis, we found the YOLOV5x variant to be the best-suited model, performing better than YOLOv3 and YOLOv5s models respectively by 7 & 4 percent in mAP, and 12 & 8.5 percent in terms of Accuracy.

translated by 谷歌翻译

Lateral Strain Imaging using Self-supervised and Physically Inspired Constraints in Unsupervised Regularized Elastography

Ali K. Z. Tehrani , Md Ashikuzzaman , Hassan Rivaz

分类：计算机视觉

2022-12-16

Convolutional Neural Networks (CNN) have shown promising results for displacement estimation in UltraSound Elastography (USE). Many modifications have been proposed to improve the displacement estimation of CNNs for USE in the axial direction. However, the lateral strain, which is essential in several downstream tasks such as the inverse problem of elasticity imaging, remains a challenge. The lateral strain estimation is complicated since the motion and the sampling frequency in this direction are substantially lower than the axial one, and a lack of carrier signal in this direction. In computer vision applications, the axial and the lateral motions are independent. In contrast, the tissue motion pattern in USE is governed by laws of physics which link the axial and lateral displacements. In this paper, inspired by Hooke's law, we first propose Physically Inspired ConsTraint for Unsupervised Regularized Elastography (PICTURE), where we impose a constraint on the Effective Poisson's ratio (EPR) to improve the lateral strain estimation. In the next step, we propose self-supervised PICTURE (sPICTURE) to further enhance the strain image estimation. Extensive experiments on simulation, experimental phantom and in vivo data demonstrate that the proposed methods estimate accurate axial and lateral strain maps.

translated by 谷歌翻译

A Dependable Hybrid Machine Learning Model for Network Intrusion Detection

Md. Alamin Talukder , Khondokar Fida Hasan , Md. Manowarul Islam , Md Ashraf Uddin , Arnisha Akhter , Mohammand Abu Yousuf , Fares Alharbi , Mohammad Ali Moni

分类：机器学习

2022-12-08

Network intrusion detection systems (NIDSs) play an important role in computer network security. There are several detection mechanisms where anomaly-based automated detection outperforms others significantly. Amid the sophistication and growing number of attacks, dealing with large amounts of data is a recognized issue in the development of anomaly-based NIDS. However, do current models meet the needs of today's networks in terms of required accuracy and dependability? In this research, we propose a new hybrid model that combines machine learning and deep learning to increase detection rates while securing dependability. Our proposed method ensures efficient pre-processing by combining SMOTE for data balancing and XGBoost for feature selection. We compared our developed method to various machine learning and deep learning algorithms to find a more efficient algorithm to implement in the pipeline. Furthermore, we chose the most effective model for network intrusion based on a set of benchmarked performance analysis criteria. Our method produces excellent results when tested on two datasets, KDDCUP'99 and CIC-MalMem-2022, with an accuracy of 99.99% and 100% for KDDCUP'99 and CIC-MalMem-2022, respectively, and no overfitting or Type-1 and Type-2 issues.

translated by 谷歌翻译

BSpell: A CNN-blended BERT Based Bengali Spell Checker

Chowdhury Rafeed Rahman , MD. Hasibur Rahman , Samiha Zakir , Mohammad Rafsan , Mohammed Eunus Ali

分类：自然语言处理

2022-08-20

孟加拉语键入大多是使用英语键盘进行的，并且由于存在化合物和类似明显的字母，因此可能是错误的。拼写错误的单词的拼写校正需要了解单词键入模式以及用法一词的上下文。我们提出了一个专业的BERT模型，Bspell针对词校正句子级别。Bspell包含一个可训练的CNN子模型，名为Semanticnet以及专门的辅助损失。这使得Bspell在存在拼写错误的情况下专门研究高度易转的孟加拉词汇。我们进一步提出了将单词级别和字符水平掩蔽组合的混合预读方案。利用这种预审前的方案，BSPELL在现实生活中的孟加拉语拼写校正验证设置中实现了91.5％的准确性。对两个孟加拉语和一个印地语拼写校正数据集进行了详细比较，显示了拟议的Bspell优于现有咒语检查器的优势。

translated by 谷歌翻译

Monkeypox Skin Lesion Detection Using Deep Learning Models: A Feasibility Study

Shams Nafisa Ali , Md. Tazuddin Ahmed , Joydip Paul , Tasnim Jahan , S. M. Sakeef Sani , Nawsabah Noor , Taufiq Hasan

分类：计算机视觉 | 人工智能

2022-07-06

由于其在非洲以外的40多个国家 /地区的迅速传播，最近的蒙基托克斯爆发已成为公共卫生问题。由于与水痘和麻疹的相似之处，蒙基托斯在早期的临床诊断是具有挑战性的。如果不容易获得验证性聚合酶链反应（PCR）测试，那么计算机辅助检测蒙基氧基病变可能对可疑病例的监视和快速鉴定有益。只要有足够的训练示例，深度学习方法在自动检测皮肤病变中有效。但是，截至目前，此类数据集尚未用于猴蛋白酶疾病。在当前的研究中，我们首先开发``Monkeypox皮肤病变数据集（MSLD）。用于增加样本量，并建立了3倍的交叉验证实验。在下一步中，采用了几种预训练的深度学习模型，即VGG-16，Resnet50和InceptionV3用于对Monkeypox和Monkeypox和Monkeypox和其他疾病。还开发了三种型号的合奏。RESNET50达到了82.96美元（\ pm4.57 \％）$的最佳总体准确性，而VGG16和整体系统的准确性达到了81.48美元（\ pm6.87 \％）$和$ 79.26（\ pm1.05 \％）$。还开发了一个原型网络应用程序作为在线蒙基蛋白筛选工具。虽然该有限数据集的初始结果是有希望的，但需要更大的人口统计学多样化的数据集来进一步增强性增强性。这些的普遍性楷模。

translated by 谷歌翻译

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Aarohi Srivastava , Abhinav Rastogi , Abhishek Rao , Abu Awal Md Shoeb , Abubakar Abid , Adam Fisch , Adam R. Brown , Adam Santoro , Aditya Gupta , Adrià Garriga-Alonso

分类：自然语言处理 | 人工智能 | 机器学习 | (统计)机器学习

2022-06-09

语言模型既展示了定量的改进，又展示了新的定性功能，随着规模的增加。尽管它们具有潜在的变革性影响，但这些新能力的特征却很差。为了为未来的研究提供信息，为破坏性的新模型能力做准备，并改善社会有害的效果，至关重要的是，我们必须了解目前和近乎未来的能力和语言模型的局限性。为了应对这一挑战，我们介绍了超越模仿游戏基准（Big Bench）。 Big Bench目前由204个任务组成，由132家机构的442位作者贡献。任务主题是多样的，从语言学，儿童发展，数学，常识性推理，生物学，物理学，社会偏见，软件开发等等。 Big-Bench专注于被认为超出当前语言模型的功能的任务。我们评估了OpenAI的GPT型号，Google内部密集变压器体系结构和大型基础上的开关稀疏变压器的行为，跨越了数百万到数十亿个参数。此外，一个人类专家评估者团队执行了所有任务，以提供强大的基准。研究结果包括：模型性能和校准都随规模改善，但绝对的术语（以及与评估者的性能相比）；在模型类中的性能非常相似，尽管带有稀疏性。逐渐和预测的任务通常涉及大量知识或记忆成分，而在临界规模上表现出“突破性”行为的任务通常涉及多个步骤或组成部分或脆性指标；社交偏见通常会随着含糊不清的环境而随着规模而增加，但这可以通过提示来改善。

translated by 谷歌翻译

LEMON: LanguagE ModeL for Negative Sampling of Knowledge Graph Embeddings

Md Rashad Al Hasan Rony , Mirza Mohtashim Alam , Semab Ali , Jens Lehmann , Sahar Vahdati

分类：人工智能 | 机器学习

2022-03-09

知识图嵌入模型已成为机器学习的重要领域。这些模型在知识图中提供了实体和关系的潜在表示，然后可以在下游机器学习任务（例如链接预测）中使用。这些模型的学习过程可以通过对比正面和负三元组来执行。虽然所有千克的三元组都被认为是正的，但负三元三联通常不容易获得。因此，获得的采样方法的选择在知识图嵌入模型的性能和有效性中起着至关重要的作用。当前的大多数方法从基础知识图中实体的随机分布中获取负面样本，这些样本通常还包括毫无意义的三元组。其他已知方法使用对抗技术或生成神经网络，从而降低了过程的效率。在本文中，我们提出了一种方法，以产生有关实体的可用互补知识的信息负面样本。特别是，预训练的语言模型用于通过利用实体之间的距离来形成邻里群集，以通过其文本信息获得符号实体的表示。我们的全面评估证明了拟议方法在基准知识图上具有链接预测任务的文本信息的有效性。

translated by 谷歌翻译

HyperSegNAS: Bridging One-Shot Neural Architecture Search with 3D Medical Image Segmentation using HyperNet

Cheng Peng , Andriy Myronenko , Ali Hatamizadeh , Vish Nath , Md Mahfuzur Rahman Siddiquee , Yufan He , Daguang Xu , Rama Chellappa , Dong Yang

分类：计算机视觉

2021-12-20

由于物体形状和图案（例如器官或肿瘤）的高可变性，3D医学图像的语义分割是一个具有挑战性的任务。鉴于最近在医学图像分割中深入学习的成功，已经引入了神经结构搜索（NAS）以查找高性能3D分段网络架构。但是，由于3D数据的大量计算要求和架构搜索的离散优化性质，之前的NAS方法需要很长的搜索时间或必要的连续放松，并且通常导致次优网络架构。虽然单次NAS可能会解决这些缺点，但其在分段域中的应用尚未在膨胀的多尺度多路径搜索空间中进行很好地研究。为了为医学图像分割启用一次性NAS，我们的方法名为Hypersegnas，介绍了通过结合建筑拓扑信息来帮助超级培训培训。在培训超级网络培训并在架构搜索期间引入开销时，可以删除这种超空头。我们表明，与以前的最先进的（SOTA）分割网络相比，Hypersegnas产生更好的表现和更直观的架构;此外，它可以在不同的计算限制下快速准确地找到良好的体系结构候选者。我们的方法是在医疗细分Decovaton（MSD）挑战的公共数据集上评估，并实现了SOTA表演。

translated by 谷歌翻译